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Abstract 

This paper investigates first passage percolation (FPP) on inhomo- 
geneous random graphs (IHRG). The random graph model G{n, k) we 
first study is the so-called finite type case of the general model intro- 
duced by Bollobas, Janson and Riordan in [7]. Each edge of G{n, k) is 
given an independent exponential edge weight with rate 1. Our main 
assumption is that the average number of neighbors An + 1 of each ver- 
tex is independent of its type. We consider the cases where — )■ A is 
finite or infinite. Afterwards the general model is also considered. 

The paper can be considered a generalization of written by 
Bhamidi, van der Hofstad and Hooghimstra, where FPP is explored on 
the Erdos-Renyi random graphs, a special case where all the vertices 
are of the same type. We find analogous results for the minimal weight 
of the path between uniformly chosen vertices in the giant component 
and for the hopcount, i.e. the number of edges on this minimal weight 
path. The proofs make use of a direct relation between FPP on the 
IHRG and thinned continuous-time multi-type branching processes. 
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1 Introduction 



The study of random graphs has gained increasing attention the past decade 
due to the fact that large complex networks (such as the World Wide Web) 
have arisen in different fields of study. Much earlier, in the 1960's Pal Erdos 
and Alfred Renyi were the first to study random graphs. In their seminal 
model each edge of a graph on n vertices is present independently with prob- 
ability p, these are the Erdos- Renyi (ER) random graphs. Putting p = c/n, 
it is a classical result (see [TO]) that a giant component emerges (called the 
supercritical case) if and only if the constant c > 1. The Erdos- Renyi graphs 
are homogeneous in the sense that the edges are present with equal proba- 
bilities. However many real life networks do not show this property. 

More recently in [7j BoUobas, Janson and Riordan introduced a general 
inhomogeneous random graph model, which also includes the ER random 
graphs as a special case. The vertices are assigned different types and the edge 
probabilities depend on these types given by a 'kernel' function. Amongst 
other results, they characterized the supercritical regime. In this regime they 
also proved what the typical graph distance is between two randomly chosen 
vertices of the giant component. Typical distances have also been studied in 
other models, see for example |H], [13]. A possible generalization is to give 
the edges different edge weights. 

This leads us to the problem of first passage percolation (FPP) in some 
random environment. Let the environment be some random graph and give 
each edge a random edge weight, typically independent and identically dis- 
tributed (i.i.d.) positive random variables. Now think of fluid percolating 
through the graph from some source at a constant rate. As the environment 
grows one is interested in the asymptotics of various quantities of the flow. 

In [4J Bhamidi, van der Hofstad and Hooghiemstra analyzed FPP on the 
ER random graph with i.i.d. exponential edge weights. They proved that 
the hopcount, i.e. the number of edges on the shortest-weight path between 
two randomly chosen vertices in the giant component, follows a central limit 
theorem. Furthermore, they show convergence in distribution for the weight 
of the shortest-weight path. Both of these quantities scale as logn (see 
[U Theorems 2.1 and 2.2]). Related results for FPP with exponential edge 
weights can be found in [2], [3], [13] . 

This paper aims to continue this line of work by investigating FPP on 
inhomogeneous random graphs with i.i.d. exponential edge weights. In the 
finite-type case of the general model of [7] we show analogous results to that 
of [4j for the hopcount and the weight of the shortest-weight path. Afterwards 
we show how these results can be generalized to the original model of [7]. Our 
main assumption is that the average number of neighbors of each vertex is 
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independent of its type. We consider the cases when the average degree tends 
to a finite hmit and also when it tends to infinity. Our arguments follow the 
outline of |1] and [7] , however there are new challenges to face. It is standard 
to couple the exploration process of the graph to a branching process (BP). 
With the addition of types and exponential edge weights it is not enough 
to work in discrete time, we rely on the theory of multi-type continuous 
time branching processes (see [1]). Furthermore, because of the types we 
lose independence in the BP, which leads to the use of more complicated 
techniques. 



2 The model and the main results 

In this section we define the random graph model G{n, k) which is the finite- 
type case of the general model introduced in [71 Example 4.3]. The finite- type 
case has already been studied by Soderberg [12]. Each vertex is assigned a 
type from a finite set and the vertices are connected with different probabil- 
ities depending on their types given by a kernel matrix. After introducing 
some standard notation we state the main results concerning the hopcount on 
G{n, k) and the weight of the shortest-weight path under different conditions 
for the average degree. 

Let 5 = {1, 2, . . . , r} be a finite set equipped with a probability measure 
11, i.e. /ij := > and /i • 1 = 1, where 1 = (1, 1, ... , 1)"^ . Each vertex 
i is assigned an element from S, referred to as its type and denoted by t{i). 
For each n we have a deterministic or random sample of n points from S. 
Say rit vertices are of type t, so ^^g^ = n. We assume that the samples 
are taken in such a way that 

nt p 



n 



fit holds for every t E S. (1) 



For example if the samples are independent and identically distributed (i.i.d.) 
with distribution /i, then ([T]) holds by the law of large numbers. 

Up until Section [6] the kernel k = {K,{s,t))l^^^ will denote a symmetric 
non-negative rxr matrix. The natural interpretation of k, is that it measures 
the density of edges. More precisely, the random graph G{n, k) on vertex set 
[n] = {1,2, ... ,n} is defined as follows. Two vertices i and j (with i ^ j) are 
connected with probability 

f K(t(i),t(j)) ] 
p.«*0):=min|^iAi:^,l| (2) 

independently for each possible pair i ^ j. k, has only finite different values, 
so for n large enough (n > maxn) the probability simplifies to K.{t{i),t{j))/n. 
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We can see that the Erdos-Renyi random graph is indeed just a very special 
case of this model. Namely, let S consist of a single point and k = c. Then 
the probabilities Pij are all just simply c/n (for n > c). 

Finally, independently of all randomness each edge e is given an edge 
weight Eg, which is exponentially distributed with rate 1. During many of our 
arguments, we will make use of the memoryless property of the exponential 
distribution. 

The following figure shows an example with |iS| = 2 and n = 7. Start from 
the complete graph and perform different coin tosses on each edge according 
to ^ indicated by the various colors, the resulting graph is G{n, n) (without 
the edge weights). Here nodes {1,2,3,4} are of type 1 and {5,6,7} are of 
type 2, the edge probabilities pn = 2/3 = P22 and pi2 = P21 = 1/2. 



Complete graph G(n, k) 

Remark 2.1. 

(a) The general model in detail will be considered in Section^ In many 
cases the results in |^ are first proved for finite-type kernels and then 
approximation arguments are used for general kernels. This is why it is 
important to first understand FPP in the finite-type case. 

(h) We can assume that jii > Vz. An argument will he given in Section^ 

(c) We also assume that k, is irreducible. Basically, this means that the 
vertices ofG{n,K,) can't be split into two parts so that the probability of 
an edge from one part to the other is zero. The formal definition is given 



in Definition 6.4 



It will be useful for us to use the following notation: 

Xst := K{s,t)fit, (3) 
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for all s,t G S. Notice that Xst has very literal meaning. For any given vertex 
of type s the number of its neighbors of type t are binomially distributed with 
parameters rit — 5st and k{s, t)/n {5st = 1 if and only li s = t). So from ([T]) in 
the limit we get that Xst gives us the average number of type t neighbors of a 
type s vertex. Later this will allow us to approximate with suitable Poisson 
distributions. From here we construct the matrix 



A 



i All — 1 Ai2 ■ ■ ■ \\r \ 

A2I A22 — 1 • • • A2r 

y Afi Ar2 ■ ■ ■ Afr ~ V 



(4) 



Principal Assumption 2.2. We will assume that the row sums of the ma- 
trix A are the same and equal to X > 0. 

This means that in the limit the average degree is independent of the 
types. This is referred to as the homogeneous case in [7:, Example 4.6]. In 
this case the global behavior of G{n, k) in the limit is the same as of the 
ER random graph, but the local behavior can be quite different. The A > 
condition is necessary and sufficient for a giant component to emerge in 
G{n,K) [7!, Theorem 3.1]. 

Let us introduce some standard notation. Let Bin(n,p), Poi(A), Exp(/i) 
respectively denote a binomial, a Poisson and an exponential random variable 
with the parameters having their usual meaning. Convergence almost surely, 
in distribution and in probability are denoted by respectively. 
A sequence of events holds with high probability (whp), if it holds with 
probability tending to 1 as n — )■ 00. We use the Landau symbols O and 
with their usual meaning. We say that a sequence of random variables A„ 
satisfies A„ = Op(6„) if A„/6„ is tight (i.e. We > 03M : P (A„ > M6„) < e) 

or Xn = op(6„) if Xn/bn — )■ 0. Now we turn to the main results. 

We investigate the weight and the number of edges on the shortest-weight 
path between two vertices x and y. The following two theorems describe the 
asymptotic behavior of the weight of the shortest-weight path and the number 
of edges on this path, the hopcount. 



Theorem 2.3 (CLT for hopcount). Under Assumption 2.2 the hopcount 
Hn between two uniformly chosen vertices, conditioned on being connected, 
satisfies a central limit theorem of the form 



logn 

A 
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where Z is a standard normal variable. 

Next we consider the asymptotics of the minimal weight path. 

Theorem 2.4 (Limit distribution for minimal weight path). Under Assump- 
tion 2.2 there exists a non- degenerate real valued random variable W , such 
that the minimal weight Wn between two uniformly chosen vertices, condi- 
tioned on being connected, satisfies 

Wn - log n W. 
X 

The distribution of W can be determined precisely from multi-type branch- 
ing process quantities, which we will describe later. 

Remark 2.5 (Joint convergence). The proof will show that the above con- 
vergence hold jointly with independent limits. 

A possible extension is to have k depend on n. In t his case we have a 

with A„ as the sum 
lowing form: 



2.2 



sequence of matrices A^, each satisfying Assumption 
of the rows. Then we can extend our theorems in the fo^ 

Corollary 2.6. //lim„_>oo A„ = A < oo then for the hopcount we have 

^ 

logn 

where Z is a standard normal variable, and for the weight of the FPP 

Wn - ^ log n W. 
A 

It is interesting to compare these results with its counterpart where there 
are no edge weights. Then of course Gn = Wn- Theorem 3.14 of [7] in 
our context says that in this case the typical distance between two vertices 
in the giant component still scales as logn, but with a different constant: 
l/log(A + 1). Comparing the constants, we see that 

1 1 A + 1 

< — < — =— . 

A log(l + A) A 

This shows that by adding edge weights the structure of the graph changes. 
Along the shortest-weight path more vertices are visited (Hn) than on the 
path with the least number of vertices. At the same time the weight of the 
path (Wn) becomes smaller than the number of edges on the path with the 
least number of vertices. The figure below illustrates this, where the red path 
is the shortest-weight path while the green is the one with the least number 
of vertices. 
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Similarly as in ^ we also investigate the dense regime, where lim A„ = oo, 
i.e. the average degree tends to infinity. In this case whp any two vertices 
are connected, so the giant component contains n(l — o(l)) vertices (see [7]). 
Again comparing with the counterpart without edge weights, the change 
in the graph structure is even more significant. Without edge weights the 
graph is ultra small, meaning that graph distances between uniformly chosen 
vertices are o(logn). However with the addition of edge weights the following 
theorem states that even the magnitudes do not coincide. We get the same 
type of behavior as observed in the sparse setting, which means that on the 
shortest-weight path many more vertices are visited. 



Theorem 2.7 (Dense setting). Under Assumption 
then we have 



2.2 and if lim A„ = oo, 



(a) 



Hn — 



An + 1 
An 



logn 



yiog 



n 



where Z is a standard normal variable and we can substitute ^^sii by 1 

An 



if and only if A„/ y/\ogn — )■ oo. 



(b) 



KWn - logn 



where in distribution W is equal to the sum of independent random 
variables Yi + Y2 — Y^, with Yi and Y2 standard Gumbel random vari- 
ables and Y3 another Gumbel random variable with distribution function 
exp ( — e~^^') . 
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2.1 Sketch of proof and structure of the paper 

Roughly, the proof will go as follows. It is standard to relate the exploration 
process of the neighborhood of a vertex in a random graph to a branching 
process (BP). Since the number of neighbors of a vertex in the inhomogeneous 
random graph (IHRG) is binomially distributed it is practical to approximate 
the exploration by a BP with Poisson offspring distribution. 

• Section [3] introduces the continuous-time multi-type branching process 
that will be related to an exploration process of the giant component 
of G(n, k). To keep the paper self-contained, the section quotes results 
concerning various quantities of branching processes that will be of use 
in our arguments. The section is finished with the proof of Theorem 



3.4, which is the analog of the main results in the Poisson BP setting. 



In order to carry through the results to the IHRG, first we need to couple 
the Poisson BP with the BP coming directly from the IHRG, i.e. where the 
offspring distribution is the sum of binomial distributions, with parameters 
depending on the type of the vertex. Secondly, we need to couple the BP to 
the exploration, so we need to identify the vertices of the BP and the vertices 
of the IHRG with one another with high probability. To achieve this marks 
are assigned to the vertices according to their type in such a way that we 
can rule out loop and multiple edges in the IHRG. To deal with the problem 
of cycles in the graph, the notion of thinning is introduced on branching 
processes. This way we get an embedding of the BP into the IHRG. 



• In Section |4] this crucial connection is made between the model G{n, k) 
of Section |2] and the branching process of Section [3] The coupling 
argument, marked branching processes and the notion of thinning are 
dealt with in separate subsections. 

It only remains to find the shortest-weight path between two vertices x 
and y in the giant component of the IHRG. Think of fiuid emanating from 
these two vertices simultaneously at rate 1. Intuitively it is clear that when 
these two fiows collide, then the shortest-weight path between x and y has 
been found. 

The random time of the collision will be referred to as the connection 
time Cxy. In the figure below, the vertices and Vy denote the last wetted 
vertices (along the shortest path) by the fiows from x and y respectively 
before time C^y 
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Let G{vx) denote the number of edges between the source x and vertex Vx 
and A{vx) the weight of this path. Then for the hopcount Hn{x,y) and for 
the weight Wn{x,y) of the shortest-weight path we get that 

Hnix,y) = G{vx) + G{vy) + 1, 
Wn{x, y) = A{vx) + A{vy) + Exp{l). 

Thus it is crucial to determine when the connection actually happens. The 
difficulty is that we can't let the fluid flow continuously, but only in discrete 
steps as new vertices are wetted by the flow. Intuitively one could argue that 
we could also get the shortest path by delaying one of the flows or initiating 
only one flow. However, we will see that the proof only works when the two 
flows are about the same size. 

• The final subsection of Section |4] analyzes the connection time rigor- 
ously. 

• After the preparations of Sections |3] and |4] we prove the main results 
for the finite-type case model in Section [5j 

• Section [6] deals with the general model of [7]. After necessary definitions 
and a discussion of approximating general kernels, the main results of 
Section [ij when A < oo, are generalized to this setting. 

3 A branching process 

Analogous to the use of the single-type Poisson branching process (BP) in 
the analysis of the ER random graph, another BP arises naturally in the 
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model G{n, k). Namely, when we explore the neighborhood of a vertex x in 
the inhomogeneous graph with edge weights, the types of vertices determine 
the distribution of the number of neighbors, leading to a multi-type BP. 
We can think of the neighboring vertices being explored at time t when the 
distance from x is less than t. Thus, the exploration can be considered as a 
continuous time process. As the edge weights are exponentially distributed, 
the memoryless property of it implies that it is actually Markovian and can 
be modeled by a continuous time BP known from the literature (see [T]). 

Since the number of type t neighbors of a vertex of type s follows the 
law of Bin(ni — Sst, i^{s,t)/n), we can well-approximate this by a Poisson 
distribution with parameter k{s, t)fit- We will later see that the error arising 
from this approximation is negligible. Thus, in the branching process when 
a particle of type s & S dies, it gives birth to a Poisson number of children 
of type t with parameter 

Xst ■= K{s,t)ij,t, 

for all t G iS. From here we construct the same matrix A as in Q. In this 
BP setting, the row sum (Al)^ gives the expected value of the change in the 
number of alive individuals after the death of a particle of type s. We can 



restate Assumption 2.2 the following way in this setting: 



Assumption 3.1. We shall assume that the expected value of the children of 
a particle doesn't depend on its type, i. e. the sum of each row of A is equal. 
Let A = (v41)s > denote this constant. 

The A > condition guarantees that with strictly positive probability the 
BP survives, see P, Chapter V.3]. The survival probability can be determined 



from the fixpoint equation p = 1 — exp { — {A + I) p} , and assumption 3.1 
implies that pt = p for all types t. For a proof see [2, Chapter 5.]. Making 
use of the properties of Poisson distributions and ignoring the types, under 



Assumption |3.1 we can see that the number of children of each particle has 



a Poi(A + 1) distribution. So in distribution this is equal to a single-type 
branching process with offspring distribution Poi(A + 1). We now formally 
construct the branching process. 

Let Di, D2, ... be a sequence of non-negative i.i.d. random variables with 
distribution Poi(A + 1) and define the sum Si := Dj — — !)• 

Contruction 3.2. 

1. Start with the root (1st vertex) which dies immediately giving birth to 
Di alive offspring. 

2. Each alive offspring lives for an Exp(l) amount of time, independent 
of all other randomness involved. 
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3. When the i-th vertex dies it leaves behind Di alive offspring. 

To be precise, conditioned on type t{i) of the i-th vertex, each Di is the 
sum of r different independent Poisson random variables with parameters 
given by the t(i)-th row oi A + I . The types are assigned to each new 
offspring accordingly. In this context we can identify Si as the number of 
alive vertices after the i-th death. Let Sf^ be the number of alive vertices 
of type t after the i-th death. The memoryless property of the exponential 
distribution and the fact that choosing the minimum out of i.i.d. Exp(l) 
distributions is uniformly distributed implies that looking at the process at 
the split times of the particles gives the following equivalent description of 
Construction 13.21 

Contruction 3.3 (Discrete-time reformulation). 

1. At step 1, start with one alive vertex, the root, dying immediately and 
giving birth to Di alive offspring. 

2. At each step i, pick one of the alive vertices at random, this vertex dies 
giving birth to Di children. 

Let Ti,T2, ... be the time spacings between the i-th and i + 1-th split. 
Since Tj conditioned on Si is the minimum of Si exponential variables, it 
is exponential with rate Si. Denote by Am = Ti + T2 + . . . + Tm the time 
of the m + 1-th split. Thus A^ expresses the distance of the m-th closest 
vertex from the root. We denote the graph distance, called the generation of 
this vertex from the root by Gm- By the above mentioned property that the 
minimum is uniform among the 5*^-1 alive vertices, Gm equals in distribution 
to the graph distance between the root and a uniformly chosen vertex among 
all alive vertices at step m. 

Hence Am gives the distribution of the weight of the shortest-weight path 
between the root and the m-th chosen vertex, while Gm gives the hopcount on 
this path. Thus it is of crucial importance for us to understand the behavior 
of these two quantities. With some non-zero probability it is possible that 
Si = for some 1 < i < m, i.e. the BP dies out. We will always condition on 
Si > for all 1 < i < m, which has strictly positive probability because we 
constructed a supercritical BP. With this conditioning we have the following 
asymptotics: 

Theorem 3.4 (Result for BP conditioned on survival). Conditioned on Si > 
for all 1 < i < m, the following hold for m — )■ oo.- 



A+l 



logm 




'm 



d 
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where Z is a standard normal variable and 



- ilogm 
A 



-ilog (hw 

A \ A 



(6) 



The distribution of W can be determined from the hmit behavior of the 
BP, see next subsection. 

The rigorous proof is given in Subsection 3.2[ furthermore, simulations 
also confirm the assertion of ([s]). Figures [l] and 2] show the empirical distribu- 
tion of Gmi where the two multi-type BPs were generated from two different 
kernels. In each case the BP was simulated until the m = 10000-th split. 



Case 1 



Case 2 




A+l 



i- logm 

A 



13.815 



A + I 
'1.1 0.6 0.8' 
0.4 1.3 0.8 
,0.6 0.7 1.2, 



A+l 



i- logm 

A 



15.351 



In each case we can clearly make out the nice bell-shape of the normal dis- 
tributions which peak around the expected value of log m. 



Number of aJive m each geaeratiiMi 



1; 20 



Number of aJive m each getLeration 



Figure 1: Case 1 



Figure 2: Case 2 



3.1 Asymptotics of multi-type BP quantities 

In this subsection we quote results from [H Chapter V] in the context of our 
branching process. Let us recall our notations. Sm = SJLi Dj — (m — 1) 
denotes the number of alive vertices after the m-th split (the split of the root 
is the first). Of the Sm vertices Sm are of type t. Am denotes the time of 
the m-th split. It is also important to keep track of the types of the vertices 
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that split. Let Nm denote the number of sphts of type t vertices among the 
first m sphts. We need a further assumption to state theorems about the 
asymptotics of these quantities. 

Assumption 3.5. Assume that the matrix A is irreducible, meaning that 
there exists a s.t. A^'-'' has strictly positive entries. 

Then, by the Perron Frobenius theorem we know that A is the lead- 
ing eigenvalue of A and it has associated nonnegative left eigenvector vr = 
7r2, . . . , TT-r) normalized so that vrl = 1. For Poi(As() random variables ^st 

E {^st log^st) < oo holds for all s,t e S. 

On the set of non-extinction there must be an infinite number of splits, so 
Am — > OO as m — )■ cxD. The following hold: 

Theorem 3.6 (Asymptotics of BP quantities, [1]). 

(i) On the set of non- extinction as m oo 

yOm ,Om ) , , 

hm ^ ^ = IT a.s. , [7) 

Am 

(a) and similarly 

1 iVm , . . . , JVm ) a.s. 

> vr. (8) 

m 

(Hi) me~'^^™ \~^W. 

(iv) Am — (l/A)logm converges to a finite limit — A~^ log(A~"'^iy) on a set 
of positive probability and P {W > 0) > 0. (a bit stronger then the 
previous ) 

The distribution of W is given by the moment-generating functions (pt{x) = 
Et (e~^^) , that satisfy the equation 

where (pix) = {ipi{x), . . . ,ipr{x)) and /^*^ is the r variable generating function 
of the offspring distribution of a type t vertex. W has an atom at zero of 
size 1 — p, i.e. the extinction probability of the BP. The random variable W 
in ^ is equal in distribution to (14^ > 0) (details in [H Chapter V. 7.5]). 
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Consequences of Theorem 



3.6 



1 



m 



-Sm — > A, and 



0. 



TXt a.s. 



(9) 



Note that here we did not use Assumption 3A_ Estimates can also be given 
for the magnitude of the error terms in (|9). Depending on the size of the 
eigenvalue of A with the second largest real part, one can properly normalize 
Sj — Xj and 5*]*^ — Att^j such that in distribution it tends to a well-defined 
normal random variable. For details we refer to [llj Theorems 3.22-24.]. We 
will later make use of this in some of our arguments. 



3.2 Proof of Theorem D 



Our principal assumption (2.2) implies that the increments of the number of 
alive individuals in the branching process are independent random variables 
with distribution Poi(A -|- 1) — 1. Thus, basically the same proof works as 
the one given in [S]. For the readers convenience, we reconstruct the main 
steps in our setting. Recall that Am gives the distribution of the weight of 
the shortest-weight path between the root and the m-th chosen vertex, while 
Gm gives the hopcount on this path. We start off by quoting from [3] and 
[5] a fundamental result identifying the distributions of Gm and Am as sums 



of independent random variables. Consider Construction |3.3| throughout the 
subsection. 

Proposition 3.7 (Distribution of hopcount and FPP). At step m choose an 
alive vertex uniformly at random from all vertices alive at this time. Then 

(a) the generation of the m-th chosen vertex satisfies 

m 



where /i,/2, . . . are independent Bernoulli random variables with success 
probability P (Jj = 1 | Di, . . . , Dij = Di/ Si. 

(b) the weight of the shortest- weight path between the root and the m-th cho- 
sen vertex satisfies 

m 

Am = Ti + T2 + ...+Tm = Y,Ei/S„ (11) 

i=l 

where Ei,E2, . . . are i.i.d. Exp(l) random variables. 
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Proof. For an analytic proof of (10) see [5] and a new probabilistic proof 
can be found in [3l Proposition 4.2]. We already argued that Tj is exponen- 
tially distributed with rate Si, from here we immediately get part (b), since 

Exp(n) = Ei/n. □ 
From part (a) we see that 

™ D 

E {Gm I ^1, • • • , D^) =y2^' 

i=i 

y^r{G^\D„...,D^)=f2^(^-j-)- 

i=l * \ * / 

Shortly we will see the right magnitude of the two right hand sides. To prove 
([5]) we show that the standardization 

G^ = (12) 



Eti ft (1 



Si 



converges in distribution to a normal random variable Z. This will be done 
by showing that the Wasserstein distance between the distribution of Gm 
and that of Z goes to 0. This is enough because the Wasserstein metric on 
the space of probability measures induces the same topology as the topology 
of weak convergence ([9l Theorem 11.3.3]). The Wasserstein metric on the 
space of probability measures is defined by 



Wass(/ii, /X2) = sup 



gdfii - / gdfi2 



: g is 1-Lipschitz and bounded 



The following lemma [H Lemma 3.4] gives an upper bound on the Wasserstein 
distance of the law of Gm and of Z, which we quote without proof. 

Lemma 3.8. Let Hm denote the distribution of Gm o,nd n the distribution of 
a standard normal random variable. Then with the abbreviation pi = Di/Si 

Q 

Weiss{fim,p) < 



If the process dies out, then the convergence in (12) does not make 
sense, so we still have to deal with the technical issue of conditioning on 
survival. Unfortunately, conditioning on survival removes the independence 
of the increments of {Sj}'^i. Note that the process {Sj}^i till extinction 
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is equal in distribution to a random walk on Z with independent increments 
Dj — 1 Poi(A + 1) — 1. To overcome the problem of conditioning, we couple 
the process conditioned on survival to a process conditioned to stay positive 
only in the first Um = o(loglogm) steps and then continue with independent 
increments. This method will work since after ujm steps the walk has reached 
height approximately Xum and from here, whp, will remain positive. The 
following construction makes this precise. Recall S'm = 1 + XljLi where 
F, = D, - 1. 

Contruction 3.9. Let {SI, . . . , S^^} ~ {Si, . . . , S^JSi > O^i < Um}- Fur- 
thermore, let K^m+i) • • • be i.i.d. random variables with distribution 
Poi{X + l) — l, independent of {S^, . . . , S*^J. Define the sequence Ri, . . . , Rm 
by 

The following Proposition states that the processes and {Sj\ii : 

Sj > 0}^^i can be coupled whp., i.e. they are equal whp. Furthermore, 
we can derive good bounds for {Rj}JLi from which we can obtain the right 

magnitude of Y.T=i ^ and Yl^i ^ {} ' ^) foraiula (Q. 
Proposition 3.10. 

(a) 

P {Ri > 0, . . . , Rm > 0) = I - o(l) as m -> oo, 
and there exists a coupling such that 

P (jij = Sj, I < j < rnj = 1 — o(l) as m ^ oo, 

where {Si, Sm} ~ {5"!, . . . , Sm\Si > OWi <m}. 
(b ) There exists a constant C > such that, whp, for all j > log log m 



Xj - C^/j log J < S, < Xj + C^/j log J. (13) 
(c) Let Pi = Di/Si, then 

A + 1 

^Pi(l - Pi) = —^\ogm + Op(loglogm). (14) 
i=i ^ 
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Proof of Proposition 3.10 . With slightly different notation, the proof of this 
lemma can be adapted word-by-word to the proof in [4, Proposition 3.7], since 
the increments are i.i.d. Poi(A + 1) — 1, so the same random walk argument 
and coupling works. For part (a), the rough idea is that if the random walk 
stays positive up to time a;„, then it will be with high probability around 
its expectation, so from here the event of hitting zero has exponentially 
small probability. For part (b), large deviation for sums of Poi variables and 
union bound works. For part (c), use the estimate in (b) to substitute the 
denominator by its magnitude and use Markov's and Chebyshev's inequalities 
and some analysis to estimate the error terms. □ 

Let us put all these preparations in the right order to prove Theorem 3.4 

We dealt with the technical issue of conditioning on 
In part (c) of Proposition 3.10 we saw that 



Proof of Theorem 3.4 



survival in Construction 3.9 



both the conditional expectation and variance of Gm are indeed equal to 
(A -|- 1)/A ■ logm. We argued that it is enough to show that the Wasserstein 
distance between the standardization of G,„ and that of a standard normal 



variable tends to zero. According to Lemma |3.8| the Wasserstein distance 
tends to zero if YlT=i Pj(^ ~ Pi) — ^ oo as m — )■ oo. But this is immediate 
from Proposition 3.10 part (a) and (c). Thus (|5]) is proved. 

The almost sure convergence in ([6]) is basically a restatement of Theorem 
3.6 part (iv). □ 



Remark 3.11. Note that Assumption \3.1\ was only needed for the conver- 
gence of Gm o-nd not for Am. 



4 Embedding of BP into IHRG 

This section relates the exploration process of the neighborhood of a vertex 
in the inhomogeneous random graph (IHRG) model (j(n, k) of Section [2] and 
the branching process of Section [3] to one another. 

This is done by first showing that a multi-type BP with binomially dis- 
tributed offspring can be coupled to a BP with proper Poisson offspring 



distribution under Assumption |3.1[ As a result Theorem |3.4| also holds for 
the BP with binomial offspring distribution. To obtain an embedding of 
the BP into the IHRG, we give the vertices marks to get a continuous-time 
marked BP (CTMBP) on which we define thinning. This procedure basically 
deals with the problem of finding the shortest weight path amongst multiple 
possible paths between any two given vertices. 
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In Subsection |4.3| we analyze the connection time of the two explorations 
from vertices x and y and establish the relation between the hopcount and 
shortest weight path in the IHRG and the quantities Gm and already 
studied in the CTMBP. 

4.1 Marked branching processes 

Let us define a continuous-time multi-type BP with binomially distributed 
offspring and simultaneously recall the construction of the Poisson coun- 
terpart introduced in Section [3| Recall that the set of different types is 
S = {1,2, ... ,r} equipped with a probability measure fi and n = Xlte^ ^t, 
where Ut satisfies ([T]) for every t E S. When a particle of type s dies it gives 
birth to a random number of offspring r/^t and C,st of type t in the binomial 
and Poisson BP respectively, where 



d 



k{s, t) 



7] St = Bin ytit - Sst, — - — J , and 

^st = Poi (Xst) , where Xst = n{s,t)^f 

Each offspring lives for an exponentially distributed random time with rate 
one. Recall that {rit — 5st) ■ n{s,t)/n — > Xst- When a particle of type s dies 
its offspring distribution is a sum of independent random variables 

= Vsi + Vs2 + ■ ■ ■ + Vsr, and 
-Dp^ = isi + ^s2 + ■ ■ ■ + isr respectively. 



Conditioned on the event that the j-th split is of a type s vertex, the dis- 

:>f and 



tribution of the increment of the process Df and is equal to D^^^ and 



(s) 

Dp respectively. For the number of alive vertices after the m-th split we 
write S^, = Df + ... + D^,~ (m-l) and S^ = D[ + ... + D^- {m- 1). 
Due to the memoryless property of the exponential distribution, the next 
vertex (m -|- 1-st) to split is uniformly distributed among the and Sj^ 
alive vertices. 

Under Assumption 3.1 Dp^ is equal in distribution to a Poi(A + 1) random 



variable, so in distribution this is indeed the same BP as the one defined in 
Section |3| We need to make a distinction here with the different types, 
because shortly we will give the vertices marks according to their type. But 



let us first show that the results of Theorem |3 .4| carry through to the binomial 
BP. 

Proposition 4.1 (Asymptotics for binomial BP). Consider the sequence 
, D2 . . ., defined above. Let Gm and Am be defined as in Proposition 3.1 . 
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Then, conditionally on > for every 1 < j < m, the asymptotics for Gm 
and Am in Theorem 3.4 remain to hold, where the limits are independent. 

Proof. It is well known that the random variables rjst and ^st can be coupled 
for every s,t G S, by means of a pair of variables {fjst, ^st) (with rjst ~ rjst and 
^st ~ (,st) so that for n large enough {\nt/n — fit\ ^ £ holds) we have 



XstK,{s,t) + o(l) 



n 



n 



Similarly we can couple D^^^ and Dp^ to obtain a uniform bound for 



n 



n 



Hence for a single increment of the process 

P [Df + bf) = ^P [b^§ + bp) P (type-, splits) < 



A + 1 



max K 



n 



c 
n 



Thus for any m = o(?2), we can couple the random variables , . . . , and 
, . . . , D^, such that 



c 
n 



772 

c 7-0, as 77 — !■ oo. (15) 

77 



^j<m:bf^b^)<Y, 

i=i 

Our aim is to couple and G^, the generation of the 7n-th split in the 
two processes. So, we show how to couple the Bernoulli random variables 
(/f , /J'), 1 < j < 771, conditioned on (-Df , Df ), . . . , (D^5 -^m) defined in 
( |lO) ). Let t/i, . . . , be an i.i.d. sequence of uniform (0, 1) random variables 
and define the conditional probabilities 



P (^f = 1 1 u,) = 1 



Df 

J cB 



P (^f = 1 1 u,) = 1 



Df 

. 5; 



, (16) 



where 5f = Df + . . . + Df - (j - 1), 5f = Df + . . . + D^ (j - 1) and 
1 [■] is the indicator function. Note that by the coupling in ( |15[ ) 

P (^Sf > 0, 1 < j < 771^ > for every m, 

since the Poisson BP survives with positive probability. The joint distribution 



of {If, /f ) is given by (|T6]). Let Gi 



/f + + and G^ = lf + ... + l 



By the coupling of {If^lf) we can see that for m = o{ 



77 



P(G^^Ga<p(3,<772:Df ^Df)f c^^O. 
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Combining this with Proposition 3.7 part (a) for G^, it follows that the 
asymptotics also hold for G^. We conclude that the asymptotics for can 
be proved in a similar manner. □ 

The next step is to describe continuous-time marked branching processes 
(CTMBP). By assigning marks to the individuals in the BP according to their 
type, we will be able to relate them to the vertices during an exploration of 
the IHRG. Fix n > 1 and denote the set of marks by [n] = {1, 2, . . . , ra}. It 
is important to distinguish the marks according to the types, so [n] is the 
disjoint union of the sets of marks [n]^^\ . . . , [n]^^\ where there are tit different 
marks in [n] . An individual of type s will be assigned a mark from [n] '•'^^ . We 
consider the same continuous-time BP with binomially distributed offspring 
defined earlier in the subsection. 

The construction goes as follows. Assume that the root of the BP is of 
type s. Assign the mark zq ^ [ri]^^-* to it. The root immediately dies and 
gives birth to a random number rist of type t offspring. For t s, the rjst 
"new" individuals of type t are assigned different marks from [n]^^^ uniformly 
at random. For t = s,we choose the marks the same way from [n]''*^\zo- Each 
offspring is assigned an Exp(l) variable corresponding to its edge weight. 

Inductively, the next individual to split after Tj time units can be uni- 
formly chosen from the alive vertices (by the memoryless property of the 
exponential distribution). Assume it is of type s with mark ij. It gives birth 
to r]st offspring of type t and choose r]gt different marks from [n]^*^ for t ^ s 
and T]ss different marks from [n]^^\ij for t = s. We update edge weights 
of the "old" vertices by subtracting Tj from them or just simply indepen- 
dently draw a new Exp(l) variable for each alive vertex, corresponding to its 
remaining lifetime. 

For a moment let us think in terms of graphs. By assigning the marks 
this way, we rule out the possibility of loop edges and multiple edges between 
vertices during an exploration of the IHRG. There still remains the problem 
of picking the shortest weight path amongst multiple possible paths between 
any two given vertices, i.e we have to handle cycles. 
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Marks of siblings are 
different 

Parent-offspring 
marks are different 




9 



Marked BP 



4.2 Thinning branching processes 

We introduce thinning on branching processes to identify the shortest weight 
path between any two given vertices in a very natural way. Think of a cycle in 
a labeled graph with edge weights (assume there are no other cycles within). 
There are two paths between any two vertices x and y on this path, but 
we are only interested in the shorter one. Let fluid percolate through the 
graph at a constant rate and keep track of the labels that we reach. From x 
the fluid obviously reaches y along the shorter path first and the other flow 
reaches y exactly when the mark of y reappears on our list of marks already 
reached. Thus for any shortest weight path between x and y the second path 
will not be used. 

In terms of the BP this means that when we reach a mark 

ik e {io,ii, . . .,ik-i}, 

i.e. we found a cycle in the exploration of the IHRG, we delete ik and the 
whole subtree starting from it. We call the mark ik thinned and the marks 
of its subtree truncated. As a result we only keep the shortest weight paths 
between pairs of vertices. We refer to this procedure as thinning the BP. 
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Thinned CTMBP 



^ thinned vertex 



• Of the vertices with 
the same mark, only 
keep the one closest 
to the root. 



For each fixed n, we choose marks from a finite set [n] , so with probability 
one at a certain random time all marks will have appeared. This means that 
the thinned CTMBP dies out, i.e. we found the minimal weight spanning tree 
of a component of the IHRG. It is clear that for each t > 0, the set of marks 
reached by time t in the thinned CTMBP and the set of vertices reached by 
time t in the IHRG are equal in distribution. We successfully embedded the 
thinned CTMBP into the IHRG, so we immediately get the following: 

Lemma 4.2 (FPP on IHRG is thinned CTMBP). For any fixed n > 1, 
consider the thinned CTMBP and IHRG as defined above. Then for any iq G 
[n], the weight Wn{io,j) of the shortest weight path between initial vertex 
and any other vertex j G [n] in the IHRG is equal in distribution to the weight 
of the shortest weight path between the root iq and j in the thinned CTMBP. 
Furthermore, the hopcount between io and j is also equal in distribution in 
the IHRG and the thinned CTMBP. 

Remark 4.3. 

(a) If iQ and j are not connected then we simply put the weight and hopcount 
equal to +oo. 

(b) We did not use that the edge weights are exponentially distributed. So the 
lemma holds for i.i.d. edge weights with arbitrary continuous distribution 
supported on (0, oo). 
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What remains is to make the intuitive picture of colhding flows of fluid 
precise. In the sketch of proof we mentioned that we cannot follow the 
progress of a flow continuously in time. Rather, we will look at the picture 
at each split time Ak, which means that we let the fluid percolate on each 
edge of the CTMBP or equivalently in the exploration process of the IHRG 
up until time Ak- 

For this we formally introduce the notion of shortest- weight trees SIVT^, 
for k > 1. Immediately after time Aj., the number of individuals that the 
percolating fluid already reached, the wet individuals (i.e. the ones that split) 
is + 1, whose marks we denoted by zq, ^i, . . . , ik- Let us denote this set by 
for all k > 1. We also keep track of the marks of the alive vertices, 
M^, i.e. the individuals who are the offspring of a wet vertex but the fluid 
has not reached them yet. Clearly \M^\ = Sk- Let SWTk be the collection 
of the wet and alive marks together with the split times Aq, . . . , Ak- So 
SWTo = {{io},Ao = 0), and 

SWTk = (M,-, M+, {Ao, Ai, . . . , Ak}) ,k>l. (17) 

The CTMBP can be uniquely reconstructed from the sequence {SWTk)'^^i. 
Note that SWTk contains all the marks in the CTMBP, also the thinned and 
truncated marks and possibly multiple marks among alive vertices. Later we 
will need an upper bound on the expected number of thinned marks (i.e. the 
number of cycles we find in the exploration process up to and including time 
Ak) together with the truncated marks. The proof given here is different 
from the proof of the corresponding lemma [H Lemma 4.5]. 

Lemma 4.4 (Expected number of thinned and truncated marks). Fix k > 1 

and denote by MT^ the number of thinned and truncated marks in , and 
similarly ATk stands for the truncated alive marks in . Then 

E(MT,-+AT,)<5^-ii±^. (18) 

Proof. We calculate the size of the subtrees of thinned vertices such that a 
subtree includes all the wet and alive vertices. Note that this way we might 
count vertices on multiple occasions, but this upper bound is still sufficient 
for us. At time Aj the vertex with mark ij splits. Denote the subtree, 
including wet and alive vertices, of ij by Fj. Then 

k 

E [MT^ + ATk) < E (|Fj| ■ 1 [ij is thinned]) . 
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Recall that N^^^ denotes the number of splits of type t vertices among the 
first k splits and there are nt different marks that correspond to vertices of 
type t. Next we estimate the probability of the event that ij is thinned: 

P {ij is thinned) = P [ij is thinned | type t) P (type t) 
tes 



ie-s ^ 



(*) 



Ut n 



n 



(19) 



where for (*) we used g, Q and (|9|. By ^ Theorems 3.22-24.] it IS easy 
to check that the error term is negligible (after summing over j). The last 



equality is a consequence of Lemma 4.5 

Next step is to determine \Fj\. By the memoryless property of the process, 
it is easy to see that the proportion of the subtree of ij compared to the size 
of the BP satisfies 

(20) 

where \F\ = Sk+i + k is the number of vertices in the entire BP including 
wet and alive vertices. 

Since \Fj \ and the event ij is thinned are independent, we find that putting 
all of these together with the help of Proposition 3.10 part (b): 

k 









\F 




j 



E (MT- + An) < 5^ E (|F, I) P {^, is thinned) <1 E (|F,|) 

k 



I 

n 



3=0 



D 



Sj+i 



{Sk+i + k) 



n 



D 



J + 1 



< E 



A(j + 1; 

Xn 



n 



O 



(Sk+i + k) 



Xk^ 



n 



Lemma 4.5. Under Assumption 3.1 we find that 



□ 



TT = /i, 

where tt is the left eigenvector corresponding to the eigenvalue X of A and fi 
is the probability distribution given on S. 
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Proof of Lemma \4.5[ Recall that Agt = n{s,t)fit — Sst- Using the symmetry 



of K and assumption (3.1), fiA = Xfi follows immediately. □ 



We show a similar result to the assertion of Lemma 14.41 for the number 
of multiple marks among M^, these are the cycles that will be found in the 
exploration process after step k. Let MT^ denote the number of multiple 
marks in M^. Instead of proving for general k, pick k = ^/n. We will see in 
the next subsection why this special case is important. 

Lemma 4.6. Let an = ■ Then MT^ = Op(l) , i.e. the sequence MT^ 
is tight. 

Proof. When assigning marks to new vertices of type t we prescribed some 
constraints on the set [n]*^*^ from which we choose its mark. We can dominate 
this by repeatedly choosing from without any constraints. From ([t]) we 
know that Sa} = A7rta„(l + o(l)). So let us sample Xirtanil + o{l)) marks 
from [n]^*-* with replacement. For i E [n]^^^ let Xi be the number of times i 
was chosen, thus Xi ~ Bin(A7rta„(l + o(l)), l/ut). The probability that i is 
chosen at least twice is 



P(X, >2) = l-P(Xi = 0)-P(X, = l) = ^-f + 0. 3 



Thus for the expected number of multiple marks 



2 nf \ni 



E Y: 1[^.>2] = E P(X.> 2) = 1^^ + 0(1). 

Thus the expected number of multiple marks is 

E (MT+) =y±lL^ + 0(1) = + 0(1), 



where Lemma 4.5 was also used. Now if a„ = y/n, then Markov's inequality 



implies the tightness. □ 



4.3 Connection time 

It was already mentioned in the sketch of proof that to determine the shortest- 
weight path between two vertices x and y, we let fluid percolate from each 
one and stop when the two flows collide. For technical reasons we don't let 
the fluids flow simultaneously from both vertices, rather we let the fluid flow 
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from X until it reaches some a„ vertices and then start a flow from y until 
the random time C„, the connection time, when the two flows collide. To 
be able to make this precise, we define the connection with the help of the 



shortest-weight trees defined in (17). 

Recall that SWTa^^ after the split time Aa„ is the collection of wet marks 
Ma^^ , alive marks M^^^^ and the split times A^q\ A^^\ . . . , A^a] ■ Then, we 
start another marked branching process from root y. Since the flow from 
y can only connect to the flow from x via an alive vertex in SWTa^\ a 
connection is possible only when a mark from M^^^^ appears among the 
marks in Mjf^~ . So we have to make a slight modification in the way we 
assign marks to the vertices in the BP from y. The procedure described in 



Ma^^ from the possible marks [n]. 

From here we define the connection time 



Subsection 4.1 remains except that from the start we leave out the marks 



= mm{k > : Mi^)+ n Mjf>' ^ 0}. (21) 

Note that the final edge still has to be formed. Normally, this would imply 
that when determining the distribution of the shortest-weight path we would 
have to add an extra Exp(l) random variable for the last edge. However, 
with some reasoning we will see that the distribution of this last edge is 
actually the same as letting the fluid flow from x until the next split. The 
figure below illustrates the connection time. 




First we present a series of lemmas and we give proofs afterwards. The next 
lemma describes the asymptotic distribution of the connection time. 
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Lemma 4.7 (Weak convergence of connection time). Conditioned on the 
event that both CTMBPs survive, the connection time C„ satisfies the asymp- 
totics 



n 



where E has an exponential distribution with rate ^Ylses'^s/ f^s' which 
plifies to rate A under Assumption 



sim- 



3.1 



This result supports that the intuitive picture of letting the fluids flow 
simultaneously or delaying one of them indeed makes no difference in terms 
of finding the shortest path. 

We want to show that when the connection happens, the probability that 
a type t vertex splits in SWT'^'^^ is the same as the probability that the next 
vertex to split in SWT'^^'> is of type t. So choosing the mark from M'tJ^ at 
the connection is the same as choosing the next vertex to split in SWT^^^ 
from the same set. This would allow us to say that the last edge forms simply 
by letting SWT'^^'^ flow until the next split a„ + 1. 

In Lemma [4. 4| and 4.6 we proved that the number of truncated and mul- 



tiple marks in is of constant order. So we have for a„ = y/n 

q{t) 

PW^a„ + l)=t) = ^ = ^t(l + 0(l)). 

The next lemma shows that when the connection happens the probability 
that a type t vertex splits in SWT'^'^'^ is not ttj in general, but under Assump- 



tion 3.1 the two probabilities are equal. We believe that the corresponding 
arguments in [1] require some correction, so the proof given here is new and 
uses other methods. 

Lemma 4.8. In the limit n — )■ oo, the probability that, when the connection 
happens, the vertex to split is of type t satisfies 

/ I X vr?/ lit 

P [type t splits | connection) 



Proof of Lemma |^. 7[ Let us first show the proof neglecting error terms and 
then we will make the arguments rigorous. For the probability P (C^ > m) 
we can write 

m 

P (C„ > m) = J]E (P (C„ > J I C„ > J - 1, , (23) 
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where J^j-i is the cr- algebra generated by SV/T^^^ and SWT^''^\. To calculate 
P [Cn > j I Cn > j — I, J^j-i) we have to sum over the types in SWT'^^'^ and 
find the probability that it does not connect to an alive vertex in SWT^^^ of 
the same type. Formally, 



P(C„>j|C„>j-l,j;_i) =5^ 



s. 



(y) (t) 



civ) 
tes '-'j-i 



q{x) (t) ~ 

nt 



Plugging this into (|23|) with m 

1^ \ 



p a > 



xn 



J 

xn/an 

n 




\ 2 

n ^ lit 



exp 



ret 



/it 




In order to have short formulas we neglected the error terms along the lines 
but using Theorems 3.22-24.] it is easy to see that the dominating error 
terms are of order ^^e(^2)/A-i ^-3/2 (depending on whether Re{\2) > |A 

or Re{X2) > ^A, where A2 denotes the eigenvalue of A with the second 
largest real part. Lemma 4^ implies that under Assumption 3A_ we have 

Ete5^tM = l- □ 

Proof of Lemma \4-S[ We neglect the error terms along the lines again but 
note that the very same estimates work as in the proof of Lemma |4.7[ 

Let us introduce the events C := {connection happens} and tf be the 
event that a type t vertex splits in SWT^^^ when the connection happens. 
To form the connection, a vertex of some type t splits among the alive vertices 
in SWT^y'' and connects to an alive vertex of type t in SWT^^\ Thus 



(x) (t) g(y) (t) 



S. 



(y) 



-(1 + 0(1)). 



Using a simple Bayes' theorem we conclude that 

p(c|tt)p(n) _ 



n 



P{t]\C) 



>Ja„ I lit ■ I '^C„ 



P(C) 

A • ir^/fit ■ an/n 
AE.e5 ^s/f^s ■ an/n 



c(^0 (s) /„ qiy) (s) / q(y) 

>Ja„ I lis ■ ^C-n I "^Cn 



'^seS 
(1 + 0(1))^ 



TTiV/it 



□ 
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We are now ready to prove the main theorem of this section, which makes 
the idea outhned in the sketch of proof precise. 



Theorem 4.9 (Connecting hopcount to heights in CTMBP). Let Hn and 

Wn denote the hopcount and the weight, respectively, of the minimal path 
between vertices x and y in the IHRG G{K,n), where we condition on the 
vertices x and y to be in the giant component. 



(a) For n — )■ oo, whp, 



ff„^Gi:Vi + Gg, (24) 



W^n = 4:Vl + ^£, (25) 

Proof. In SWT'^^^ the mark to spht at time A^a} is i^a} , while in SWT'^y^ the 
mark to spht at aI^^ is This is the same mark that is ahve in SWT^^^ 

at A^a} 1 to make a distinction denote it by i']^\ 

Since x and y are connected, whp, x and y are contained in the giant 
component of the IHRG, which implies that both CTMBPs do not die out. 



The main contribution of Lemma 4.8 and the argument preceding it is that 



when the connection happens we choose uniformly from the alive vertices 
in SWT^^'^ which in distribution is the same as Ga„+i and the split time in 
distribution is Aa„+i. 



From Lemma 4.4 and 4.6 it follows that the probability that the shortest 

{x) 



weight path from x to i']^' contains a truncated or multiple mark is at most 



^{ATa +MT+) 




This shows that whp Ga„+i gives the hopcount between x and i^^\ Similar 

Ay) 



(v) 

argument shows that, whp, the hopcount between y and is equal in 



(v) A k 

distribution to G(j . This estabhshes (24). 



The argument for part (b) is simple. The minimal weight path is the 
route X — )■ i^^ — )■ Iq^ — t- y. The weight on this path is 



Wr, = W{x ^ ^'S:J + W{^^^l ^ zg) + W{^'^l ^ y) ^ ^^Vi + 



where we can simply reallocate the weig ht A^ ^ W{x — )■ i^a}) over to 
X — )■ i^j due to our construction and the weight of the last edge W{i^^^^ — )■ i[^^) 
can be merged into ^i^+i- □ 
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5 Proof of main results 



In this section we prove our main results stated in Section [2j The proofs are 
analogous to the ones in 



Proof of Theorem 2^. Observe that, conditioned on C„ the random variables 
G^] and G^^ are independent. Introduce the abbreviation /3 := (A + 1)/A 
and define the random variables 



G^a} - (3 log an 
V/3 log a„ 



Ay) .._ 



Gg-/31ogC„ 
V/31ogC„ 



and denote bv f i . .^2 two independent standard normal random variables. By 



Proposition 



4.1 



we know that Zn"^ converges in distribution to .^1 and zj^^ 
converges to ^2 conditioning on the event that {C„ = m^}, where m„ = o{n). 
Formulating this with the Wasserstein distance (introduced in Subsection 

,bles 

(26) 



3.2 ), we find that for any bounded continuous function g of two real variables 
hm E (g{Zi^\ Z^^l) \ C„ = m„) = E (^7(^1, 6)) • 



Hence by bounded convergence and using ( 22 ) and ( 26 ) , 



ny 



limE (g{Z^:\ Zg))) = ThmE (g{Zl:\ zg) | C^- 
= E((7(ei,6)) 



dP (a< 



ny 



which shows the joint weak convergence of the pair (Z„ , Z^ ) to (.^i, ^2)- By 
the continuous mapping theorem P Theorem 5.1] applied to (x, x + y 
and by Theorem 4^, we find that 

ff„ - /3 log n _ - (/3/2) log n ^ - (/3/2) log n 



log n 



\//31ogn 



^(3 log n 



log n 




and we note that (^1 + .^2) / "\/2 is again standard normal. When switching 
from (/3/2) logn to /31ogC„, we suppressed the term (/31ogi?)/-\/logn (where 

E = Exp(A)), since it is op(l). Note that the limit is independent of C.„ as 
well. □ 
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Proof of Theorem \2.4\ Similarly to the proof of Theorem|2.3[ by Proposition 



|4.1|for any sequence m„ — )■ oo, 



- iloga„, -hogCn I Cn = m?j ^ (Xi,X2 



(27) 



where Xi and X2 are two independent copies of X = — i log (^^W^j • Combin- 
ing Theorem 4.9 , Lemma 4.8 with (27) and the continuous mapping theorem, 
we obtain 

iy„ - i logn = - i loga„ + - i logC„ + i log ^ , 
A A A A '^n 



>X2 



Al/AlogE 



where E = E xp(A) is independent of (Xi, X2). Thus the distribution of W in 
Theorem 2.4 is the sum Xi +X2 — 1/XY, where Y has a Gumbel distribution 
with distribution function exp ( — e~^^) . □ 



Proof of Corollary 2J). It is immediate from the two previous proofs with 



A„ + 1 A + 1 
> — c— 



An 



A 



A 1 1 

and ^ y ^ 

An A 



as ri — )■ cxD. 



□ 



Proof of Theorem \2.T\ In the dense setting, where A — )■ 00, we have a se- 
quence of kernels /«n(s, t), n = 1, 2, . . . and s,t E S. The type t neighbors of 

a type s vertex have distribution rj^J^^ = Bin{nt — Sst, i^n{s, t)/n). We avoid the 



coupling with Poisson random variables in Proposition by directly giving 
a proof for the CTMBP where the offspring distribution {Df^^ltjpe s splits) 
is the sum of independent binomial random variables 77^"'* , t G S (for details 



see Subsection 4.1) 



To prove part (a), concerning the hopcount on the shortest-weight path, 
it is enough to show that ^ holds with (A + 1)/A replaced by (An + 1)/An, 
since from here we can argue the same way as we did in the sparse setting. 
Let . . . , i^'-^'', ^1, '^2; • • • denote independent standard normal random vari- 
ables. By the CLT we get that 



(2;f)|type5splits)^5^( 
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From here, with the usual notation S'^"'' = Yl]=i ~ ~ 



Therefore 



1 + , . (28) 



Combining Proposition 3.7 with (28), we find that Gm satisfies the same 
asymptotics as 



where P (jf ^ = 1 

i=l 

with Jg"'', . . . , Im^ independent. Applying Lemma 
for any m = rrin — oo. 



A„ + l 



we conclude that 



3.8 



where Z is standard normal and /3„ = (A„+l) / A„. Note that the denominator 
can be replaced with -y/logm, since 

f;y-(l-y)=/3nlogm + 0(l). 
i=i ^ ^ 

The centering constant PnYll^=i 1/^ "^^^ ^e replaced by /3„logm, which es- 
sentially completes the proof of part (a). We can further argue that the cen- 
tering constant can be replaced by logm if and only if (/3„ — 1) A/log m — )■ 0, 
or equivalently v^log n = o{Xn). 

Let us continue with the proof of part (b), concerning the min ima l weight. 
We again use that S'l"'' concentrates around iXn + 1- Proposition 3.7 part (b) 
states that 

Notice that the sequence Em/m, Em-i/m — 1, ... , Ei/1 gives in distribution 
the spacings of the exponential random variables Ei, . . . So the sum 
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YllLi Ei/i is equal in distribution to a random variable Bm that is the maxi- 
mum of m independent exponentially distributed random variables with rate 
1. The distribution function of 3^ is 

From here we find that 
P (aa£) - loga„ < x) = P < X + loga„) = (l 



-(x+loga„)^ 



e ' 

= exp (-e""^ + 0(l/a„)) ^ exp (-e""') = A(a;), 

where A denotes the distribution function of a standard Gumbel random 
variable. Similarly to the proof of Theorem 2A~ we conclude that 

(\A^:} - log a„, AAg - log Cn) ^ (Fi, F2), 

where Yi and I2 are two independent copies of a standard Gumbel random 
variable. From Theorem 4.9| we get that by the continuous mapping theorem 

XWn - log n = \A^:} - log a„ + A^g - log C„ + log - log a„, 

V ' ^ V ' ' ^'• ' 

where Is is an independent G umb el random variable with distribution func- 
tion exp ( — e~'^^) by Lemma 4.7 □ 



6 Extending to the general model 

In this section the general model of [7] is described. To extend our results 
to this setting, following the lines of [7j, general kernel functions are approx- 
imated by a sequence of finite-type kernels already studied in detail in the 
previous sections. 

In the general setting 5 is a separable metric space equipped with a 
Borel probability measure /i. For each n we have a deterministic or random 
sample of n points x„ = (xi, . . . , x„) from S. We assume that the empirical 
distribution 




i=l 



converges in probability to /i as n — )■ 00, where 6x is the measure consisting 
of a point mass of weight 1 at x. This convergence condition is equivalent to 
the condition 

^n(5):=^^^^^^-^M5), (29) 
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for every /i-continuity set S G S (i.e. S is measurable and there is no mass 
on the boundary of S). This is the analog of condition ([T|. 

The pair {S, /i) is called a ground space and for a sequence (x„)„>i satis- 



fying (29) we say that the triplet (iS,/x, (x„)„>i) defines a vertex space v. A 
(generalized) kernel k on a ground space is a symmetric non-negative mea- 
surable function on S x S. From here the random graph G{n, k) is defined 
the same way as in the finite-type case. For a given kernel k and vertex space 
ly, an edge {ij} (with i ^ j) exists with probability 

P.,:=min|^^^^,l}. (30) 

Independently each edge is given an Exp(l) edge weight. 
Remark 6.1. 

(a) Kernel n is regular finitary ^ Definition 4-4] ^/'^ has a finite partition 
into (fi- continuity) sets Si, . . . ,Sr such that k, is constant on each Si x Sj 
for all 1 < i,J < r. Clearly, a random graph G{n,K,) generated by a 
regular finitary kernel has the same distribution as a finite-type graph. 

(b) Note that measure zero sets cannot be ignored the usual way, since chang- 
ing K on a set of measure zero may have a significant effect on the graph 
G{n, k). 

The model is too general thus far, so further restrictions are needed for 
the kernel k. Some technical conditions are needed and the total number 
of edges e{G) need to "behave the right way". These are collected in the 
following definition, from [7]. 

Definition 6.2. A kernel k is graphical on a vertex space (5,/x, (x„)„>i) if it 
is continuous almost everywhere (a.e.) and in L^{S x S, fi x fj,), furthermore 



-E {e{G{n, k))) ^ \ j f k{x, y)dii{x)dii{y) . 



(31) 



Condition (31) says that G{n,K,) has about the right number of edges, 
since 



E{e{G{n,K))) = E j ^min{^ 

\i<j 



[K{xi,Xj)/n, 1} 

which tends to n| JJ k under certain well behaved cases (see [71 Lemma 8.1]). 



For example, condition (31) holds whenever n is bounded and z/ is a vertex 



space. Li and (31 ) together imply that the (expected) average degree is 0(1), 
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meaning that the results will only be extended in the sparse setting. Further 



note all the conditions of Definition 6.2 automatically hold in the finite-type 



case. To use a sequence of approximating kernels we need to let k depend 
on n. The next definition contains the conditions for a sequence of kernels. 

Definition 6.3. Let v = {S,fi, (x„)„>i) be a vertex space and n he a kernel 
on V. A sequence of kernels on {S,fi) is graphical on v with limit k, if, 
for a.e. {y,z) G S^, 

Hn y and z imply that Zn) n{y, z), 

K G Li and continuous a.e., and 

-E {e{G{n, k,,))) ^ \ I ( k{x, yW{x)My). (32) 

We also need to exclude cases where the vertex set of Cj(n, k) can be split 
into two parts so that the probability of an edge from one part to the other 
is zero. This is formalized in the following definition, which is the analog of 
Assumption 3.5 for the matrix A defined in (|4]). 



Definition 6.4. A kernel n on a ground space {S, ^) is irreducible if 

A ^ S and k = a.e. on A x {S \ A) implies fi{A) = or ^{S \A)=0. 

As a slight modification we say that k, is quasi-irreducible if there is a fi- 
continuity set 5' C 5 with fi{S') > such that the restriction of k, to S' x S' 
is irreducible and K,{x,y) =0 if x ^ S' or y ^ S' . 

Next, we describe the branching process that arises when exploring a 
component of G(n, k). It is a multi-type continuous time branching process 
with type space 5, where a particle of type x E when it splits, gives birth 
to a set of offspring distributed as a Poisson process on S with intensity 
measure K{x,y)dfi{y). That is, the number of children with types in a subset 
S (Z S has a Poisson distribution with mean K{x,y)dfi{y). Each offspring 
lives for an Exp(l) amount of time independently of each other. 

We also define an integral operator, whose norm establishes a direct con- 
nection with the emergence of a giant component in the random graph G{n, k) 
and the survival of the above mentioned branching process. Let be the 
integral operator on {S, fi) with kernel k defined by 

{TJ)ix):= j^Kix,y)f{y)dfi{y), (33) 
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for any measurable function / such that this integral is defined. The norm 
of is 

IIT.II :=sup{||T./||2:/>0, H/b < 1} < oo, 

where || ■ II2 is the norm of L^(iS,/i). In the finite-type case T^f = {A + I)f, 
so easy calculations give us the norm: 



IT 



\A + I\ 



A + 1, 



where the last equality holds because of Assumption |2.2[ Theorem 3.1 of 
[7] states that a giant component emerges in G{n, k„) and its corresponding 
branching process survives with positive probability if and only if ||Tk|| > 1. 
This is why we assumed throughout that A > 0. 

It only remains to formulate our main assumption in the general setting. 
Let {S, fi) be an arbitrary ground space and let k be such that it satisfies 



k(x, y)dfi{y) = A + 1 for a.e. x. 



(34) 



This means that asymptotically the average degree is independent of the type. 
We can see that in this case the norm ||Tk|| is indeed A + 1. An example for a 
kernel k satisfying (34) can be given by taking S = (0, 1] (interpreted as the 
1 dimensional torus T^), as the Lebesgue measure and k{x, y) = h{d{x, y)) 
for an even function > of period 1, where d{x,y) is the metric given on 
S. Figures [3] and |4] show the contours of two such examples, where the dark 
purple strips indicate where the contour is zero. 





Figure 3: h{z) = l[\z\ < 0.25] 



Figure 4: h{z) = 
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6.1 Approximation of kernels 

As mentioned before, the extension of the main resuhs to the general model 
will be done by constructing a sequence of partitions of S for which a sequence 



of approximations of the limiting kernel k are defined, each satisfying (34). 
The approximating kernels are regular finitary kernels, so the results of the 
previous sections apply to them (the regular finitary case and the finite-type 
case differ only in notation). The idea of using approximating kernels comes 
from [U Section 7]. 

In the regular finitary case we may assume that the type space S is 
finite. Furthermore, we can assume that fig > for every s G 5, however 
an argument is needed. We cannot simply ignore such types, since measure 
zero sets can alter G{n, k) significantly. We can argue as follows, Ij^. 

Suppose that /i^ = for some s E S. Start by redefining the kernel 
K,'{s,t) = K,'{t,s) := maxK for every t G iS and leaving it alone otherwise. 
Then define a new probability measure n' by shifting some small mass t] over 
to s from the other types. Clearly fj,^ > for every t G 5. Possibly the types 
of some of the vertices changes, so change them correspondingly. This way 
we obtain a vertex space z/' = (iS,/i', (x'„)„>i) with kernel k'. It is not hard 
to see that we can couple G{n, k) and G'{n, n') so that G{n, k) C G'{n, k!). 
Finally, letting 77 — 0, the norm of T^' with respect to yu' tends to the norm 
of with respect to /x, which is equal to the norm of T^, since n = n' a.e. 
Iterating for other measure-0 types, we can see that it suffices to consider 
cases where /xt > Vt G 5. 

We continue with the definitions of the approximating kernels. Given 
a sequence of finite partitions Vm = {^mi, • • • , ^mM^}, m > 1, of 5 and 
an a; G iS, we define im{x) as the element of Vm in which x falls, formally 
X G Am,i^{x}- As usual, diam(y4) denotes sup{d{x,y) : x,y E A} for A C S, 
where d is the metric on our metric space S. Lemma 7.1 of [7] states that 
for any ground space (5, fj,) there exists a sequence of finite partitions of S 
such that 

1) each Ami is a /^-continuity set, 

2) for each m, Vm.+i refines Vm, 

3) for a.e. x G iS, diam{AmA^{x)) — )• 0, as m — )■ 00. 

For such a sequence of partitions we can define a sequence of approxima- 
tions of K by taking its average on each Ami x Amj'- 



Kmix,y):=—- ^ — r // K(s,t)d/i(s)d/i(t). (35) 
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If K is continuous a.e. then property [3j) implies that Km{x,y) — > K,{x,y) for 
a.e. every (x, y) G S^. To be able to apply our theorems for finite-type kernels 



we need to guarantee that Assumption 2.2 holds for all Thus, considering 



Km as a finite-type kernel with respect to the partition {Ami, • • • , AmMm)i the 
row sums of Rm weighted by /x must be equal to some constant Cm- In fact 
easy calculations show that independently of the partition sequence or the 



ground space, only using assumption (34) and the fact that k is symmetric, 
this holds with Cm = A + 1. 



6.2 Extension of Theorems 12.31 and 12.4 



We have made all the necessary preparations to extend the results of Theo- 
rems 



2.3 and 'lA to the general model described in this section. Recall that 
Wn denotes the weight of the shortest-weight path between two uniformly 
chosen vertices in the giant component, while if„ denotes the hopcount (num- 
ber of edges) on this path. 

Theorem 6.5 (Hopcount & Shortest weight in general setting). Let (5, /i) he 

an arbitrary ground space and k he a uniformly continuous, quasi-irreducible, 
graphical kernel on {S, /i) that satisfies sup k(x, y) < oo and 

J k{x, y)dfi{y) = A + 1 < oo for a.e. x E S. 

Then the hopcount Hn between two uniformly chosen vertices, conditioned on 
being connected, satisfies a central limit theorem of the form 

^ ^) 

^logn 

where Z is a standard normal variable. Furthermore, there exists a non- 
degenerate real valued random variable W , such that the minimal weight Wn 
satisfies 

Wn - ~ log n W, 
A 

where the distribution of W can be determined precisely from the behavior of 
the multi-type branching process that arises when exploring a component of 
G{n, k) 

Proof. Let (iS, /x) be an arbitrary ground space and kernel k satisfies the 
conditions of the theorem. These define the sequence of random graphs 
(G(n, K))n>i- Take any sequence of finite partitions Vm = {Ami, ■ ■ ■ , AmM^}, 
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m > 1, that satisfy properties [T]) , [2]) and [3]) described in Subsection 6.1 For 
each m, consider the finite type approximating kernel defined in (35), 
with ground space {Sm, fJ') (where \Sm\ = M„ 



As a result we obtain the 



sequence (G(n, Km))n,m>i- Note that in the proofs for finite type kernels none 
of the estimates depend on /i^ or the cardinality of Sm, so all the error terms 
are uniform. The condition svLpn{x,y) < oo is necessary because it is used 



in the proof of Proposition 4.1 



To prove the results we let n and m tend to oo simultaneously in a 
carefully chosen way. For fixed m, Theorem 2.3 states that 



hm P 

n— >oo 



HI: 



A+l 



i- logn 

A 



^±1 logn 

A 



< X 



where $(x) is the distribution function of a standard normal random variable. 
On the other hand, letting m depend on n, from the error estimate given in 
Wasserstein metric in Lemma 3.8 it is easy to see that 



rrm(n) _ A+l 
-n.n 



logn 

A 



^logn 

A 



< X 



$(xl 



< 



ogn 



(36) 



where C(A) is a A-dependent constant. 

To be able to couple the graphs G{n, k) and G{n, k^), we need a relation 
between n{x,y) and Rmix,y). Since n is uniformly continuous, for {u,v) G 

Rm{u,v) < K{x,y){l ± Em) if \u - x\ < 5^ and \v ~ y\ < 6m, 

where diam{Am,i^(x)) < Sm and diam{Am,i^{y)) < Sm- Abbreviate Am,i^{x) 
by Amx- In G{n, k) the probability that there exists an edge between two 
vertices of types x and y is K{x,y)/n. While in G{n,Rm), between types 
Amx and Amy this probability is Rm{x,y)/n, which we can now replace by 
K{x,y){l ± Em)- Thus 



P (1 [{x, y} e e{G{n, k))] 7^ 1 [{Amx, Amy} e e{G{n, Rm))]) < 



2Er 



n 



Summing over all possible edges, we find for the edge sets that 

2n'^Em 

P (e(G'(n, k)) 7^ e(G(n, Rm))) < - " = riEr 



2n 



(37) 
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Now fix m. The 5m and uniform continuity of k defines Em- Let 

A^o,^ := max i^N : e^A^v^logiV < l| . 

Clearly for all n < iVo,m, the coupling between G(n, Rm) and G{n, k) fails 
only with probability less than l/\/logn. On the other hand, for an n we 
can pick a large enough m which approximates G{n,K) well, namely 

m{n) := inf {m : A^o,m > . 

Now we can couple G{n,n) to G{n,Km{n)) so that the coupling fails with 
probability less than l/^/[ogn. Under the coupling, also for the hopcount 
we have 

P{Hr,^Hf '^^) < l/v^ = o(l). 



Combining this error bound with the one in (36) we obtain that 



^ < X 

^logn 

A 

/ jjm{n) _ A+1 1 

= P " " <x\H^ = H'f^'^ I (1 - 0(1)) + P ^ if^^")) 

= ^{x) [l-Oi^^\ogn +C(A)/v/k^+ l/v^k^. 

Finally letting n — )■ oo (thus m{n) — > cxd also), we obtain the desired result 
for the hopcount. 

Now we turn to the proof of the convergence of the shortest weight path. 
To avoid conflicting notation we will denote Pn{i^) the shortest weight path 
belonging to G{n,K). We can use the same coupling argument as for the 
hopcount to get the estimate 

P (P„(k) 7^ P„(/tm(n))) < 



\ogn 

We know from the finite type case that 

where = 1,2 is i.i.d. random variables, distributed as — i log{\~^Wm) 

and X3 is a Gumbel variable with a A-dependent parameter. Thus for the 
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weak convergence of P„(/t) it is enough to show that 
or equivalently, we need to show that 

To see this we use triangle inequahty for some k 



1,2, 



e ^^''■^m(n)(TA:) 



\Wm(n) -W\< \Wm{n) " 

+ |e~^^'=(Zm(„)(rfc) - Z{Tk 
+ |e"^^'=Z(r,)-iy|, 



(38) 
(39) 
(40) 



where Z{t) and ^m(n) denotes the continuous time branching processes re- 
lated to kernels n and Rra(n)^ respectively, and stands for the /c-th split in 
these processes. 



To handle the second term (39) we have to pick k < y/n to be able to 



use the coupling of the graphs and branching processes and from (37) we see 
that the probability that they differ is of order 1/logn. 

To handle the first and third term first observe that from [U Theorem 2., 
Chapter V. 7.] we know that the distribution of W'^^\ the limit process of 
the martingale e~^^Z{t) started at a single type-s particle, is not depending 
on the type s E S under Assumption 3.1, We also have E {W'^^'>^ = f ^ = 1 
where w = 1 is the right eigenvector of the matrix A. We will use a multi- 
type extension of ^ Theorem 2., Chapter III. 10.] stating that conditioned 
on Z{t) and non-extinction, one can write the decomposition on a set of 
probability 1 

s&S 3=1 j=l 

where the wj^^-s are mutually independent for all s and j and distributed as 
W. 

Thus, we immediately get from the Central limit theorem that 



y/k 



y/k 



AiV(0,a^) 



Clearly the variance of 
know from Theorem 3.6 



;he limiting normal variable is then Ao"^. We also 
that ke'^^'' \-^W . Then, for the first term (|38| 
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we have 



1 1 



n the first term is of order Op{n ^/^). 



thus with k = 

The third term (40) can be handled similar way using that W^'^^ = W for 
all s G iS and gives an another error term of order Op{n~^^^). Combining the 
error terms yields the convergence in distribution of Wm to W thus finishing 
the proof. □ 
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